A multinomial logistic mixed model for the prediction of categorical spatial data
نویسندگان
چکیده
This article may be used for research, teaching, and private study purposes. Any substantial or systematic reproduction, redistribution, reselling, loan, sub-licensing, systematic supply, or distribution in any form to anyone is expressly forbidden. The publisher does not give any warranty express or implied or make any representation that the contents will be complete or accurate or up to date. The accuracy of any instructions, formulae, and drug doses should be independently verified with primary sources. The publisher shall not be liable for any loss, actions, claims, proceedings, demand, or costs or damages whatsoever or howsoever caused arising directly or indirectly in connection with or arising out of the use of this material. In this article, the prediction problem of categorical spatial data, that is, the estimation of class occurrence probability for (target) locations with unknown class labels given observed class labels at sample (source) locations, is analyzed in the framework of generalized linear mixed models, where intermediate, latent (unobservable) spatially correlated Gaussian variables (random effects) are assumed for the observable non-Gaussian responses to account for spatial dependence information. Within such a framework, a spatial multinomial logistic mixed model is proposed specifically to model categorical spatial data. Analogous to the dual form of kriging family, the proposed model is represented as a multinomial logistic function of spatial covariances between target and source locations. The associated inference problems, such as estimation of parameters and choice of the spatial covariance function for latent variables, and the connection of the proposed model with other methods, such as the indicator variants of the kriging family (indicator kriging and indicator cokriging) and Bayesian maximum entropy, are discussed in detail. The advantages and properties of the proposed method are illustrated via synthetic and real case studies. 1. Introduction Categorical spatial data, such as land use classes, vegetation species types, or categories of socioeconomic status, are all important information sources in spatial analysis, resource management, decision support systems, and planning in general. Such data types, which can be nominal, ordinal, or interval, exhibit spatial or spatiotemporal patterns with sharp boundaries and complex geometrical characteristics. This discrete (non-Gaussian) nature limits the applications of successful statistical methods that have been widely used for continuous variables including the celebrated kriging family of methods (Chilès and Delfiner 1999). A key task in statistical modeling of categorical spatial data is to estimate the joint probability mass function of a set of geo-referenced (spatially …
منابع مشابه
Multinomial logit - Wikipedia, the free encyclopedia
In statistics, a multinomial logit (MNL) model, also known as multinomial logistic regression, is a regression model which generalizes logistic regression by allowing more than two discrete outcomes.[1] That is, it is a model that is used to predict the probabilities of the different possible outcomes of a categorically distributed dependent variable, given a set of independent variables (which...
متن کاملDeterminants of Inflation in Selected Countries
This paper focuses on developing models to study influential factors on the inflation rate for a panel of available countries in the World Bank data base during 2008-2012. For this purpose, Random effect log-linear and Ordinal logistic models are used for the analysis of continuous and categorical inflation rate variables. As the original inflation rate response to variables shows an appar...
متن کاملExpert networks with mixed continuous and categorical feature variables: a location modeling approach
In the context of medically relevant artificial intelligence, many real-world problems involve both continuous and categorical feature variables. When the data are mixed mode, the assumption of multivariate Gaussian distributions for the gating network of normalized Gaussian (NG) expert networks, such as NG mixture of experts (NGME), becomes invalid. An independence model has been studied to ha...
متن کاملBayesian Inference for Poisson and Multinomial Log-linear Models
Categorical data frequently arise in applications in the social sciences. In such applications,the class of log-linear models, based on either a Poisson or (product) multinomial response distribution, is a flexible model class for inference and prediction. In this paper we consider the Bayesian analysis of both Poisson and multinomial log-linear models. It is often convenient to model multinomi...
متن کاملA Stick-Breaking Likelihood for Categorical Data Analysis with Latent Gaussian Models
The development of accurate models and efficient algorithms for the analysis of multivariate categorical data are important and longstanding problems in machine learning and computational statistics. In this paper, we focus on modeling categorical data using Latent Gaussian Models (LGMs). We propose a novel logistic stick-breaking likelihood function for categorical LGMs that can exploit recent...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- International Journal of Geographical Information Science
دوره 25 شماره
صفحات -
تاریخ انتشار 2011